accelerate >= 0.12.0
seqeval
datasets >= 1.8.0
torch >= 1.3
evaluate